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ABSTBACT 

A Monte Carlo investigation of siK roi>ust correlation 
estimators was conducted for data from distributions tfith longer than 
Gaussian tails? a bisquare coefficient, the Tukey correlation, the 
standardized sums and differences, a biweight standardized sums and 
differences, the transformed Spearman's rho and a bivariata trimaed 
Pearson*. Evaluation of the estimators was based on bias and 
variability as measured by mean squs-:^ error, and efficiency relative 
to the Pearson correlation coefficit; Correlations of .9, .3^ .6, 
and -0 for samples of size twenty and forty for different weight 
tails were used. Two estimators that stood out as not being sitcnatioo 
specific and demonstrated robust properties were the averaged 
bisquare, and Tukey correlations. The transformed Spearman rtAD was 
easy to calculate, and for small correlations was clearly the best of 
all the estimators considered; however, it degenerated as the 
correlation increased so that it was among the worst for a 
correlation of .9. The standardized sums and differences estimator 
demonstrated robustness and warrants further investigation. The 
biweight standardized sums and differences estimator did nDt 
demonstrate robustness, aiid the bivariate trimming estimator 
demonstrated limited usefulness. (Author/RL) 
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ABSTRACT 



A Honte carlo investigation of six rcrust zzrrelation 
estimators was conducted for data from distri :rions with 
longer than Gaussian tailn: a bisquare coefficient^ the 
Tufcey correlation^ the standardized sums and di:^f erences<r a 
biveight standardized sums and differences^ the transformed 
Spearman's rho and a bivariate triniBed Pearson- Evaluation 
of the estimators was based on bias and variability as 
measured by mean square error^ and efficiency relative to 
the P€iarson correlation coefficient. Correlations of -.9^ 
.3, 4.6^ and .0 for samples of size twenty and forty for 
different weight tails were used. ^ . 
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T^, XMTRG:ZIEZ!!30H ■ 

Dc 3g the nusrx tiia decades^^ reseairchers have been 
investii^rriag r.fae Tsiility aad ^ficiency of least squares 
es-timarrjrs when the a^snira prion of "a Gaassism distribution is 
Yiolatsi and have clearly demcmstrated the inadequacy of 
such ^stimartors, inclisdiiig iihe Pearsoji product-moment 
correJstioa cc^efficient r for distrzibutions mith longer than 
Gaussian tails. I aew z:.^ ass of esr ifflat07:s , called resistant 
and robust^ has been pixrpxjsed to eFrtimate paxaioeters of long 
tailed distribi^tions (^EHinc^el, 197;:; Hogg^ 1967; Huber, 
1972; Hainer^ 1975). H dastinct±on between resistance and 
robustness is not always ciS^ear in the literature and the 
term robust is often used to describe both properties* 
Resistance refers to the ability of a sunraary statistic to 
remain constant in view of a change in a small part of the 
data^ no matter what part or how substantial the change. 
The term robustness does not have singular meaning and in 
the context of this paper refers to the variability of the 
estimator. 

The resistant and robust estimation of parameters 
involves the transformation of the observations to some 
other values such as ranks and/or the differential treatment 
of the observations such as the trimming of extreme 
observations or the weighting of all observations by some 
scheme. Resistant and robust location and regression 
estimators have been studied extensively (Andrews, Bickel, 



Harapel, Huber^ Rogers and Tukey, 1972; Andrews^ 1974; 
Gross., 1976r 1977; Huber, 1977; Maronna, 1976; Ramsay, 
1977)/ but the investigation of such correlation estimators 
is limited (Denby and Larson, 1977; Devlin, Gnanadesikan 
and Kettenring, 1975; Gnanadesikan and Kettenring, 1972; 
Hainer and Thissen, 1976) • Unlike the least squares 
solution, there is not a unigue relationship betveea 
correlation and regression for most robust and resistant 
estimators. The literatures are therefore separate and 
distinct- 

The criteria for the treatment of observations for the 
estimation of the correlation coefficient laust take into 
account the position/value of an observation vithin the 
respective univariate distributions as well as the bivariate 
distribution. While no one estimator has demonstrated 
overall superiority, the more promising correlation 
estimators involve trimming in the univariate context of the 
X and Y distributions and the bivariate context of the major 
and minor axes« 

This study compares six of the more promising robust 
correlation estimators* Mean sguare error and efficiency 
relative to the Pearson are used to evaluate the estimators. 
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2. ROBOST CORBELATIOH ESTIMATION 

One of the first papers to give an overview of 1 .m: 
problems of multivariate estiroation is: Gnacadesikan a 
Kettenring (1972)*. They base the estiaation of correlatfLon: 
on the quarter square identity 

cov(X^Y) = I/a [varCX «- Y) - var{X - Ij] 2.1 

and using the usual standard deviations s{X) and s(Y) obtain 
the Pearson correlation coef ficient 

r(P) cov(X,X)/s(X) s(Y). 2-2 

Standardized Sums and Differences - r*CSSr) 

Robust correlation estimators can be obtained by usiaq 
any robust estimators of scale to calculate estimates of the 
va/riaaces and standard deviations and using them in 
equations 2-1 and 2-2^ 

A problem with this procedure is that the robust 
correlation estimates may not lie within the bounds [-1# 
+ 1 ]. The solution is to standardize the univariate 
distributions first, using robust estimators of location and 
scale, t* and s*, respectively* Let 

X = (X-t*(X) )/s*(x) and y ^ (I-t* (Y) ) /s« (Y) - 2.3 

The resulting robust estimator is called the standardized 
suos and differences/ c*(SSD), 
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2 2 2 2 

r*vSSD) = £3*(s+y) --s*{x-y) )/(s^(r^y) +5^ y) ). 2^4 

Any robust location and scale estisators ran be used 

±ut the esfciiSH^terri most often used are TS ^ymsrr^triicrally 

tritfltaed lE^anH £:ad Tariances» {In this paper:? th^ ™r2minq 

percental refers to the total triiaiaed, ane f of the 

paxcent fnom each end of the distribution.) !li±3 estxsator 

pexfornieu well in studies by Devlin^ et al. ^^m^ Denby and 

Laxi5on« 

This approach looks first at the univariate 
distrikxliix)ns independently and then the mc^ior (x+y) and 
lainor C:r-T) axes of the bivariate space. It is^ important to 
note tiiiat an observation which is considered a univariate 
outlier: aight not be considered a bivariate outlier when we 
exaiaine the scatter plot of the data and visa versa* 

Biweig ht Standardized Sums and Differences - r*(BSD) 

In a study of confidence interval robustness for the 
correlation coefficient, Chou (1980) developed a biveiqht 
measure of correlation having the form of the standardized 
sums and differences (2.4) but used the bineight location 
and bar statistics (Hosteller and Tukey, 1977) as the 
estimators of location and scale. This estimator will be 
denoted as r*(BSD)* In the Chou study it demonstrated 
robustness in several situations but its properties as a 
point estimator are unJcnown. 



The biveight stai:istic is 
likelihood estimator of: location. 
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if u > 1 2.6 
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^;^d u = {X - t* ) / c^* 2.7 

i i k-1 k-1 



ezLd k refers to the kth iteration, c is sooe constant and s* 
is a measure of scale«> 

In this study, c - 9 and the median of the absolute 
deviations (EAD) Kas used as the measure of scale. 

The corresponding aeasure of scale is called "bar" for 
robust variance. 



2 2a 
n >• (X-t*) n-u ) 

barCX) = 2.8 

_ 2 2^2 2 

[>* (l-u ) (1--5U ) ][-U>» ) (1-5u ) ] 
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where >• means to sum only those observations where u < 1- 



Tu k e f Correlation • r* (t) 

The Tukey correlation is also based an the guarrier 
square identity where s* is an unfaiassa nstiaator of t:he 
standard deviation and a robust measure c± scale obtained 
from the order statistics of the observations. Let 

s" = X/s*(x) and y" = Y/s*rZ; 2«9 

then 

2 2 

r*(t) = 1/U [(s*(x" + y")) - {s*(r'^ - y")) ]# ^-10 

where s* = [ U77245/ (n (n-1) ) ] > d (i) g (i) • 2.11 

For a sample of size the subscript i represents the rank 
of the observations in ascending order and g(i) represents 
the distance between two adjacent observations; i.e., 

g(i) = X(i + 1) - XCi) for i = 1,,,,^n-l 2,12 

and d(i) = i (n - i) . 2*13 

The Wainer and Thissen study was inconclusive as to the 
value of r*(t) and Chou {1979) has shown it to perform well 
in some situations. 
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Trans £ar^n Spearma n ^ s Rho - r*(S) 
The tramsf armed Spearman^s rho 

r*CS) = 2 sin (3,1 a 159/6) r (S) 2.1« 

is viewed as having robust properties and was included in 
the study of robust correlation estimation by Sainer and 
Thissen with mixed results. The ease of calculation and 
familiarity make it an attractive estimator to use* 

Bivariate Trimsainq - r*(BVT) 

Bivariate trimming refers to a general class of robust 
estimators which trim a specified percent of observations on 
the major and minor axes. Once a given percent of the 
sample observations are identified as outliers, they ar^ 
trimmed and the usual Pearson calculated from the reduced 
sample. The procedures for identifying the outliers differ. 
Devlin et al. used an iterative approach anci Chou (1979) 
used the least squares estimates of location and scale to 
standardize the univariate distributions and then obtained 
the crossproducts. The largest positive and negative 
crossproducts were then trimmed. 

In this study 10% symmetrically trimmed means and 
variances were used for standardizing with 20% symmetric 
trimming of the crossproducts. 
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Bisquare Coefficient • r*{BM 

The bisquare correlation coefficient was introduced by 
Hosteller and lukey 

2 1/2 

r*iB) = + 1 / [1 + bar{I-bX) / b bar (X) ] 2,15 

with the sign of b^ where b is the slope of a bisquare 
regression fit. The bisquare regression fit results from an 
iterative weighted least squares algoritha with the weights 
determined by the relative magnitude of the residuals, 

Baranowski (1980) investigated several aspects of this 
estimator including problems with respect to the preliminary 
bisquare regression fit. A problem of this estimator is tftie 
lack of stability of the estimate given the transposition of 
X and Y. The use of the least squares start and a constant 
of 9 was the prefered solution for the regression fit. To 
solve the problem of transposition, Baranowski proposed an 
averaged bisquare, r*{B*), obtained by fitting both 
regression lines, calculating both correlation estimates and 
taking the average* This averaged bisquare estimate was 
more efficient than the single bisquare in all situations. 
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3. DATA and DESIGN 

Long tailed distributions were simulated by the 
generation of contaminated bivariate normal distriuiitions. 
(This is one of several approaches used in other research 
and is not intended to imply that this is ho^* long tailed 
distributions occur in real situations.) Given a bivariate 
standard normal distribution with a specified correlation, 
observations were replaced with probability bata with 
observations from a bivariate normal distribution yith mean 
equal to zero and variance greater than one- The 
correlation in the bivariate standard normal and the 
contaminating normal distributions were equal to avoid 
ambiguity of what is being estimated. 

The random normal deviates were generated by the 
Harsaglia rectangle-wedge method. The randoia uniform 
numbers used in the method were generated* by the linear 
congiruential method, incorporating the exchange sequence of 
aiiCLaren and Harsaglia (Algorithm M; Knuth, 1969) to break 
the serial correlation between successive nt;?abers^ The 
desired variances and correlations were induced 
algebraically. 

Thirty-two contaminated situations resulting from the 
combination of four parameters were examined: 
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generation of contaminated bivariate normal distriuiitions. 
<This is one of several approaches used in other research 
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^1 

1. population correlatioa: rho = -0^ .3^ ,6 aud .9 
2m samplfj size: n ^ 20 and 40 

3. standard deviation of the contaminants: sigma = 2 
and 3 

4« level of contarainaticn: beta = .10 and •25, 

Eight non-contaminatod situations were also examined for the 
combination of sample size av^d population correlation- 
These bivariate standard normal situations can be 
represented as cases where beta equals zero. 

For economy not all situations were uniquely generated* 
Given sample size and population correlat ..n# the same 
bivariate normal data were used repeatedly across all 
contaminating situationis. This also increases the internal 
consistency of the comparisons of the estimators across the 
various contaminated situations. A similar scheme was used 
by Devlin^ et al. 

In order to generalize the results of any Monte Carlo 
study^ the variability of the statistics should be small. 
Prior to the study, it was decided that the maximnm standard 
error of the mean of the correlation estimators should be 
.01. Since the variability is inversely related to the 
number of replications, regression sampling (Cochran, 1963) , 
a variance reduction technique, was used in this study to 
redvuce the required number of replications given the desired 

o 13 
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precision. The Student's t transf ormatiori of the maximum 
likelihood estimator of the population correlation, given 
the assumptions of zero mean and equal variances, (Kraemer, 
1975) calculated only from the observations from the 
non-contaminated bivariate distributions was used as the 
auxiliary variable. Table 1 gives the number of 
replications used for the various parameter settings- 
Insert Table 1 about here 

4. RESULTS 

To summarize, the following correlation estimators were 
calculated over forty parameter setting: 

1« the Tukey correlation - r* (t) , 

2* the transformation of Spaarman's rho - r*(S), 

3. the hiwedght standardized sums and differences - 
r*(ESD) , 

the standardized sums and differences - r*(SSD) 
using 10% symmetrically trimmed means and 
variances, 

5. bivariate trimming - r*(3VT) - using 10X 
symmetrically trimmed means and variances for 
standardizing X and I and a 20% symmetric trimming 
of the crossproducts. 
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6. the averaged bisquare - r*(B«) - with a least 
squares start and constant of 9 for the bisquare 
regression fit, and 

?• the Pearson correlation coefficient - r(P)* 

Mean square error which takes ato account both the 

variability and bias of an estimator Has the principal means 

of evaluating the robust estimators^ 

2 _ 2 

MSE = S(r*) + (r* - rho) • ' U.I 

The MSE'S for the estimators are presented in Table 2 using 
the two-way analysis by medians (Tukey, 1977) • This 
procedure decomposes the MSE into a grand effect^ the 
effects cf the estimators ( (r*) ) and the effects of the 
contamination (ct) , The values in the subtables are 
residuals- This type of analysis is especially useful when 
the residuals are small relative to the other effects since 
we can focus directly on the effects. Any MSE can be 
reconstructed by summing the three effects and the residual, 

MSE = grand effect + r* ct + residual. 4.2 

Insert Table 2 about here 

All values of MSE were multiplied by 10000 to make the 
tables more readable. For each correlation and sample size 
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there are tyo tables^ one for each of the different 
contaminating standard deviations. Within 3ach subtaLle the 
grand effect is in the upper right, r* effect in the left 
column and contamination effect in the top row. 

The contamination effect is smaller for a standard 
deviation of two than for three and for ten percent 
contamination than for twenty-^f ive percent. The residuals 
for the subtables in Table 2 are not as small as we might 
like. In some cases, the residuals are larger than the 
effects indicating that an additive model is not appropriate 
here and that some sort of interaction might be taking 
place. Mo one estimator is best in all situations and the 
differences between the estimator effects are small, except 
for an occasional situation where one estimator stands out 
as best or worst, such as the large negative effects for a 
correlation of .0. 

Insert Table 3 about here 

. Table 3 contains the grand effects of MSB. MSE varies 
with the level and extent of contamination as well as sample 
size and population correlation. To better understand the 
USE for the various estimators, bias was examined. Table 4 
contains the means, adjusted by the regression sampling 
procedures and multiplied by 1000, for the Pearson as well 
as the six robust estimators. Contamination and sample size 
do not appear to affect bias and since the residuals are 
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generally close to zero, there does not appear to be any 
interaction. Since the maximum standard error of the meaui 
is .01 for all estimators, a difference of less than 20 
between estimators could be considered negligible* Proa 
this table, a bias of r* (S) to underestimate the correlation 
for correlations of • 9 and .6 clearly stands out. The only 
other bias occurs for r*(BVT) for large sample size and 
correlation of • 9. 

Insert Table U about here 

Table 5 contains the grand effects of the adjusted 
means- Generally bias is not a problem except for small 
sample size and a very large correlation vhere there is a 
bias to underestimate the value. 

Insert Table 5 ab^out here 

In addition to MSE and bias, the efficiency of the 
estimators relative to the Pearson is considered, 

efficiency = aSE(r(P)) / MSE(r*). 4.3 

A robust estimator should have smaller HSE than the Pearson 
in all contaminated situations and perform almost as well as 
the Pearson when no contamination is present. Table 6 
contains the efficiency of the six robust estimators. Rhen 
contamination is present, we c^m always do better than the 
Pearson and when the data are from a bivariate normal 
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distribution^ the Pearson is best^ as would be expected. 

Insert Table 6 about here 
5. DISCUSSION 

The following discussion of the indii^idual estimators 
draws on the information in the preceding tables- 

The estimator r*it) performed reasonably well over all 
situations. In more than half of the situations it has the 
smallest or second smal. est USE and never the largest. It 
is better than the Pearson in all situations except for a 
correlation of .9 and a contaminating standard deviation of 
2r but in this case thers^ is not mnch difference between the 
MSE for the two estimators- It also performed well in the 
non-contaminated situations. 

The performance of (S) is related to the value of the 
population correlation. It has the smallest MSE for 
correlations of .0 and -3 and is almost twice as efficient 
as the Pearson in these situations* its performance for 
correlation of .9 is worse than the Pearson- In spite of 
this^ it never has the largest MSE. The relatively poor 
performance for the larger correlations is due to the bias 
of underestimating as mentioned above- These results are 
consistent with those of Sainer and ThiE;.-5en^ which indicated 
that this is the best estimator for small values of 
correlation but not large. 
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The estimator r:'^ (BSD) is never among the best 
estimators and for many situations has the largest HSE. In 
Table 6 we can see that it is often worse than the Pearson^ 
especially for large correlations. 

Overall^ r*(SSD) performed about as well or better than 
the Pearson- It is neither the best nor the worst estimator 
in any situation. 

Inconsistent results occur for r*(BVT), It usually has 
the largest HSE, especially for correlations of ,9 when its 
effect is approximately twice that of the grand effects for 
the subtables in Table 2, For correlations of ,9 and ,61, 
the MSE is larger than that of the Pearson- For 
correlations of .3 and -0 and a contaminating standard 
deviation of three^ r* <BVT) has the second smallest MSE for 
both sample sizes and is 505S more efficient than the 
Pearson. For a standard deviation of two, it performs 
reasonably well for the small samples but not for the large. 
It does not perform well for the bivariate normal 
distributions. 

The averaged bisguare, r*(B«)r is usually the best 
estimator for correlations of -9 and .6 and is among the 
better estimators for the smaller correlations* In terms of 
efficiency r it is always better than the Pearson for the 
contaminated situations and is closje to the Pearson in the 
non -contaminated situations. 
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6* CONCLUSIONS 

In this study, the performance of many of the 
estimators is dependent on the value of the population 
correlation, sample size and/or type of contamination- The 
estimation of very large correlations appears to represent a 
special situation- Two estimators that stand out as not 
beiBg situation specific and demonstrate robust properties 
are the averaged bisguare, r*{B') and Tulcey^ r*(t), 
correlations. The transformed Spearman • s rho is an 
attractive estimator because of the ease of calculation; 
its performance however is dependent on the correlation- 
For small correlations it is clearly the best of all the 
estimators considered in this study but degenerates as the 
correlation increases such that it is among the worst for 
correlation of ^9. of the other three estimators, r*(SSD) 
demonstrated robustness and, given its performance in other 
studies, warrants further investigation- The r*(BSD) did 
not demonstrate robustness in this study and r* (BVT) 
demonstrated limited usefulness- 

The results of this study are not completely comparable 
to some of the previous research since only one type of 
non— Gaussian situation is considered - contaminated normal 
distributions- Devlin et al- used four other types of 
distributions in addition to the contaminated normals- Tvo 
of the leading estimators from their study, r«(BVT), in a 
different form, and r* (SSD) did not perform as well as some 
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of the other robust estimators included in th 
performance of r*(BM and r* (t) with respect t 
of aon-Gaussian distributions is not known- 

. ?• REAL DATA EXAMPLES - 

This was a Monte Carlo study in which the data were 
generated to possess specific distributional properties and 
knonn parameters. It is iaportant to relate the usefulness 
of this type of research to real data where the 
distributions and parameters are unknown and the purpose of 
analysis is estimation. 

Insert Table 7 about here 

Two data sets are presented in table 7. These are 
student scores on midterm and final exams in intermediate 
(example 1) and advanced (example 2) statistical analysis 
classes. Figures 1 and 2 contain the scatter plots for 
these data. 

Insert Figure 1 about here 

In figure 1, there are two observations in the lower 
left corner. The effect of these points is that the Pearson 
wiXl tend to overestimate the relationship between the 
variables. Table 8 contains the correlation estimates. All 
six of the robust estimates are smaller than that of the 
Pearson/ minimizing the effect of the two "outliers". The 
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most ejstreme estimates are r*(B3D) and r*(BVT). 

Insert Table 8 about here 

In figure 2, suspicious or questionable observations 
are on the minor axis and would cause the Pearson to 
underestimate the correlation* The robust estimates are 
larger than that of the Pearson except for r*(S) and 
r*(BVT)« These data might be considered to have fflod<3rately 
high correlation (.7) and we saw above that r*(S) has a 
tendency to underestimate in this situation* The values for 
r*(B») and r*(t), • 703 and -705 respectively, are very 
close. The value for r* (SSD) is most extreme. 

Insert Figure 2 about here 

The above data sets are small but represent the ability 
of the robust estimators to handle different types of 
suspicious points. The different estimators give different 
estimates indicating that they treat observations 
differently. The application of robust estimation to real 
data sets and pxoblems has just begun and it needs to be 
extended and continued along with Mcnte Carlo investigation. 



22 



21 



References 

Andrews^ D.F. A Robust Method for Multiple Linear 

Regression, Technoaetrics ^ 1974^ 523-31 • 

Andrews^ D.F,^ Bickel^ P.J., Hampel^. F^B., Huber^ P.J-^ 
Rogers, ».H* and Inkey, J-H. Robust Estimates 
of Location . Princeton^ N.J.: Princeton University 
Press, 1972-. 

Baranotfski, B.B. A Comparison of Regression Approaches to 

Robust correlation Estimation . Paper presented to 
the American Educational Research Association, 
Boston, Mass. , April, 1980. 

Chou, S.P* Confidence Interval Robustness of Correlation 

Coef f ic ients . Paper presented to the Psychometric 
Society, Iowa City, Iowa, Bay, 1980. 

Chou, S-P« The A pplica tion 2I Trimming Technigues to the 
Robust Estimation of Correlation . Unpublished 
Master •s Thesis, Syracuse University, 1979. 

Cochran, W.G. Sampling Technigues . New York, N.Y.: John 
Hiiey and Sons, Inc., 1963. 

Denby, L. and Larson, W.A. Robust Regression Estimators 
Compared via Monte Carlo. Communications in 
Stati s tics - Theory and Methodc logy, 1977, 
A6, 335-62. 

Devlin, S. , Gnanadesikan, E. and Kettenring^ J.R. Robust 

Estimation and Outlier Detection with Correlation 
Coefficients. Biometrika, 1975, 62^ 531-46. 



23 



Gnanadesikan, and Kettenring, J,B* Eohust Estimates^ 

Residuals aad Outlier Detection with Multiiresponse 

Data, Bj ^offie tcics^ 1972^ 28, 81- 124, 
Gross, A*M, Confidence Interval Robustness with Long-Tailed 

Symmetric Distributions, Journal of the American 

Statistical Association ^ 1976, 71, 409-16, 
Gross, A,t!, Confidence Intervals for Bisguare Regression 

Estimates, Journa l of the AmericP^n Statis tical 

Association ^ 1977, 72, 341-54. 
Hampel, F,R, A General Qualitative Definition of Robustness, 

Annals of Mat hematical Statistics, 1971, 

42, 1887-96, 

Hogg, R,V, Some Observations on Robust Estimation. Journal 
of the American St atistical Association, 1967, 
62, 1179-86. 

Huber, P,J, Robust Statistics: A Review* Annals of 

Mathema tical Statist ics, 1972, 43, 1041-67, 

Huber, P,J, R obust Statistical P rocedures , Philadelphia, Pa; 
Society for Industrial and Applied Mathematics, 
1977b, 

Knuth, D,E, The Art of Computer Programming Vol, 2/ 

Seminuaerical Algorithms , Heading^ Mass: 

Addison-Wesley Publishing C- , 1969, 
Kraemer, H,C. On Estimation and Hypothesis Testing 

Problems for Correlation Coefficients, 

gsvchometrik a, 1975, 40, 473-85, 



24 



.23 

Haronna^ fi,A. Robust M-Estiaiators of Multivariate Location 
and Scatter. Annals of Sta tistics > 1976, 4, 51-67, 

Hosteller, and Tukey, J^H, Data Analysis and Regression. 
Beading, Mass,: Addison-- Sesley Publishing 
Co,, 1977,> 

Harasay, J.O* A Comparative Study of Several Robust Estimates 

of Slope, Intercept and Scale in Linear Regression. 

Journal of the American Statistical Association ^ 

1977, ,72, 608-15, 
Stigler, S.M. Simon Newcomb, Percy Daviell and 

the History of Robust Estimation, :885-1920, 

Journal of the American Statistical Assoc iation^ 

1973, 68, 872-79- 
Tukey, J*W. Explorato ry Data Analysis. Reading, Mass.: 

Addison-Hesley Publishing Co., 1977. 
Hainer, H. Robust Statistics: A 5>urvey and Some 

Prescriptions. Journal of Educatio n al 

Statistics, 1976^ 1, 285-312. 
Hainer, H. and Thissen, D. Three Steps Towards Robust 

Regression-. Psych ometrika^ 1976, UJ, 9-34. 



EKLC 



25 



TABLE 1 
Number of fieplications 





n = 


20 


n = 


40 




net 


ct 


net 


ct 


rho 










.9 


100 


100 


100 


100 


.6 


300 


300 


100 


100 


.3 


500 


600 


200 


300 


.0 


500 


600 


200 


300 



net - no contamination 
ct - contamination 



ERIC 



2S 



TABLE 2 

MSE for r*( ) times 10000 

n = 20 n « 40 

Sigma 2 3 2 





beta 


,10 


,23 




,10 


,25 




,10 


,25 




,10 


,25 


rho 








it 












n 






.9 


34 


-4 


4 


DL 


Q 


0 
0 






1 


22 


-4 


4 






n 


1 


-12 


1 


c 


-2 


1 . 


0 


-5 


1 


-1 


r*(BSD) 


-1 


0 


0 


7 


-2 


3 


1 


0 


0 


1 


-3 


3 


r*(S) 


. 28 


1 


-1 


35 


3 


-2 


9 


0 


0 


10 


-1 


0 


r*(t) 


-2 


0 


1 


-6 


-1 


1 


n 
U 


1 


- Ji 




4 


-4 


r*(BVT) 


64 


-3 


4 


95 


-0 


0 




.1 


1 

Ji 


48 


-8 


7 


r*(SSD) 


0 


1 


-2 


-13 


3 


-z 


n 
u 


u 


Ji 


-1 


0 


0 


.6 


371 


-10 


10 


A At 

44 Z 


-Zo 


^0 


1 ^7 


-10 


10 


192 


-22 


21 


*- J 




7 


.7 


-16 


13 


-14 


-26 


15 


'16 


-38 


22 


-21 


r^CBSD) 


7 


-8 


9 


64 


-61 


60 


-2 


0 


1 


32 


-22 


22 


r*(S) 


-7 


-1 


1 


-40 


4 


-3 


2 


1 


-1 


-4 


0 


ft 
u 


r*(t) 


-35 


2 


-2 


-8 


xO 


1 n 


- 


1 9 


-1 


-29 


25 


-25 


r*(BVT) 


44 


1 


-1 


Z3 


-5 




00 


-2 


2 


39 


-17 


18 


r*(SSD) 


7 


-3 


2 


8 


-11 


1 1 




u 


A 

u 




0 


0 


.3 


543 


-14 


14 


Db9 


-4/ 


Aft 
40 




-9 


g 


302 


-27 


28 


r*(B') 


-3 


-11 


12 


11 


-16 


16 


1 


-8 


8 


28 


-13 


1 A 

14 


T*fBSP) 


27 


-1 


2 


39 


-9 


9 


-1 


-1 


2 


13 


-4 


5 


r*(S) 


-35 


4 


-3 


-106 


24 


-23 


-23 


2 


-1 


-59 


13 


-12 


r*(t) 


-18 


-5 


5 


13 


9 


-9 


-9 


-5 


5 


12 


1 


0 


r*(BVT) 


3 


7 


-6 


-58 


23 


-23 


23 


9 


-10 


-21 


24 


-24 


r*(SSD) 


9 


0 


-1 


-11 


-11 


12 


6 


3 


-3 


-11 


0 


0 


.0 


644 


-5 


5 


789 


-29 


30 


304 


-17 


16 


372 


-36 


36 




9 


0 


1 


29 


-13 


13 


-10 


-2 


3 


20 


-8 


7 


r*(BSD) 


5S 


-4 


4 


31 


-23 


22 


10 


-8 


8 


23 


-18 


18 


r*(S) 


-38 


2 


-2 


-136 


13 


-14 


-22 


1 


-1 


-68 


8 


-8 


r*(t) 


-9 


1 


0 ' 


8 


24 


-25 


-14 


-1 


1 


2 


11 


-10 


r*(BVT) 


-13 


X 


0 


-126 


16 


-16 


32 


6 


-6 


-3.6 


16 


-15 


r*(SSD) 


27 


-5 


5 


-8 


-28 


. 28 


16 


3 


-3 


-3 


-7 


7 



* estimator effect ** contamination effect 



erJc 



TABLE 3 

Grand Effects of MSE (times 10000) 





n = 


20 


n - 


40 


sigma 


2 


3 


2 


3 


rho 










.9 


34 


52 


U 


22 


.6 


371 


HH2 


152 


192 


.3 


51*3 


659 


250 


302 


.0 


6HH 


789 


304 


372 



ERIC 



28 



27 



TABLE H 
Adjusted Ucians (times 1000) 

n = 20 n = 40 



sxgma 
beta 



rho 
-9 



r(P) 

r*(B») 

r*{BSD) 

r*(S) 

r*(t) 

r*(BVT) 

r*(SSD) 



-6 



r*(B«) 

i:*(BSD) 

r*(S) 

r*(t) 

r*(BVT) 

r*(SSD) 





- 10 


.25 


. 10 


.25 




. 10 


.25 


. 10 


.25 












* 










889 


n 


-1 


0 


-7 


893 


2 


3 


-2 


-3 


1 


1 


-1 


3 


0 


0 


-2 


1 


-3 


6 


0 


0 


' 0 


0 


3 


0 


-1 


-1 


2 


2 


3 


1 


0 


0 


-4 


-3 


0 


-1 


3 


-5 


-25 


1 


2 


0 


0 


-16 


0 


1 


0 


0 


2 


-1 


-1 


2 


1 


0 


-2 


0 


0 


6 


5 


-1 


7 


1 


-11 


39 


8 


2 


-1 


-26 


-3 


-3 


0 


0 


4 


-3 




-1 


3 


0 


578 


5 


0 


0 


-10 


598 


1 


2 


-2 


-1 


4 


-1 


1 


0 


4 


3 


-3 


-2 


1 


3 


10 


-1 


0 


-1 


2 


3 


-1 


0 


-2 


5 


-3 


3 


0 


0 


-6 


-10 


7 


0 


0 


-10 


-16 


0 


1 


-2 


0 


-18 


4 


-3 


3 


-14 


7 


-3 


0 


0 


5 


0 


0 


-3 


4 


0 


0 


4 


3 


-4 


-6 


5 


9 


1 


-1 


-17 


-n 


1 


-1 


2 


-3 


-12 


0 


0 


-2 


1 



.3 





280 


2 


0 


-1 


-6 


295 


1 


2 


0 


-1 


r(P) 


-6 


-1 


3 


-9 


0 


3 


0 


-1 


-2 


2 


r*(BM 


1 


0 


1 


-2 


2 


2 


-1 


0 


-4 


2 


r*(BSD) 


5 


-1 


0 


3 


0 


-4 


1 


0 


0 


-5 


r*(S) 


-7 


0 


0 


0 


-2 


-10 


5 


-1 


1 


-4 


r*Ct) 


0 


-1 


1 


-5 


0 


2 


0 


0 


-2 


2 


r*(B7T) 


-3 


0 


^1 


0 


-6 


0 


4 


0 


1 


-9 


r*(SSD) 


1 


0 


-1 


1 


-1 


-2 


0 


-1 


4 


0 



0 


-3 


0 


-1 


1 


0 


r(P) 


3 


-1 


0 


-1 


1 


r*(B») 


3 


1 


-2 


5 


-1 


r*(BSD) 


0 


5 


1 


-1 


-3 


r*(S) 


-5 


-1 


-1 


0 


0 


r* (t) 


-1 


0 


1 


0 


0 


r*(BVT) 


-5 


0 


1 


-1 


2 


r*(SSD) 


0 


2 


-2 


2 


-4 



2 


4 


-4 


3 


-6 


0 


2 


-3 


9 


-3 


0 


0 


0 


-1 


-1 


-2 


0 


0 


-1 


1 


-1 


0 


-1 


1 


0 


-1 


1 


-2 


5 


-2 


3 


-2 


1 


0 


1 


1 


-1 


4 


-2 


0 



* estimator effect 



** contamination effect 
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TABLE 5 

Grand Effects of Adjusted Means 



n = 20 n = UO 



rho 






.9 


.889 


. 893 


.6 


-578 


.598 


.3 


.280 


.295 


.0 


-.003 


. 002 



ERIC 



30 . 



29 



TABLE 6 

USE Efficiency Relative to the Eearson 



n = 20 n = 40 



sigma 




2 




3 


1 




2 




3 


1 


beta 


.10 


. 25 


.10 


. 25 


0 


.10 


. 25 


.10 


. 25 


0 










rho 


= .9 












r*(B«) 


1.00 


1.17 


1.26 


1.52 


.95 


1.22 


.98 


1.98 


1.08 


1,00 


r*«BSD) 


.90 


1.12 


,85 


.97 


.88 


1.07 


.84 


1,69 


.73 


loOO 


r*(S) 




. 6U 


. 50 


. 73 


. 55 


. 68 


. 56 


1.03 


. 60 


. 76 


r*(t) 


,93 


1.11 


1.12 


1.24 


.91 


1.09 


.98 


1,47 


1. 12 


1.00 


r*(BVT) 




.39 


,31 


. 42 


, 27 


. 29 


. 25 


, 48 


u 27 


. 23 


r*(SSD) 


.86 


1. 16 


1.22 


1.53 


.75 


1,09 


.88 


1-66 


. 86 


.81 










rho 


= .6 












r*(B«) 


1,07 


1.10 


1.30 


1 .28 


o96 


1.12 


1.10 


1.58 


1.24 


. 97 


r*(BSD) 


.99 


.95 


1.28 


,95 


.96 


1,06 


.82 


1,36 


.71 


.87 


r*(S) 


1,0 1 


1.00 


1 • 41 




Q it 
O 1 






1 . 4o 


on 




c*(t) 


1-09 


1.09 


1-29 


1.25 


.99 


1. 14 


1.06 


i.a? 


1. 20 


.96 


r*(BVT) 


. 88 


. 89 


1« 23 


1 «t V ,3 


.70 


, 83 


. 65 


1 . 26 


. 70 


^ Q 

. DO 


r*{SSD) 


,98 


,96 


1,30 


1- 15 


,83 


1-03 


.80 


1,40 


. 88 


.81 










rho 


~ -3 














1,09 


1.08 


1,tt2 


1. 25 


,96 


1, 14 


1. 13 


1,55 


1.27 


.97 




1,02 


1o05 


1,3a 


1.21 


.88 


1. 12 


1, 17 


1.58 


1. 36 


.92 


r*(S) 


1.13 


1» 18 


1.63 


1. 5<J 


.88 


S<9 2( .' 


1,29 


1,96 


1.82 


.93 


r*(t) 


1.11 


1. 13 






.96 


1, 18 


1.19 


1,56 


1.38 


-98 


r*(BVT) 


KOU 


1-11 


1.49 


1. 


.77 


, 98 


1,12 


1.62 


1.65 


.80 


r*(SSD) 


1.05 


1.08 




1..30 


„77 


1o07 


1,16 


1.71 


1.48 


.82 










rfao 


= -0 












r*(B«) 


1.07 


1.07 


1.33 


1. 14 


.93 


1, 16 


1.10 


1.56 


1. 18 


1.00 


r«(BSD) 


1.00 


.99 


1.34 


1. 13 


,89 


1, 10 


1.02 


1.60 


1. 15 


.99 


r*(S) 


1,1t| 


1. 15 


1.62 


1. 47 


,93 


1,20 


1.16 


1-97 


1.55 


.97 


r*(t) 


1.09 


1.10 


1.30 


1.23 


.98 


1. 17 


1.12 


1-56 


1.29 


1.03 


r*<BVT) 


1.10 


1. 10 


1.59 


1.46 


.88 


. 98 


-99 


1.72 


1.44 


.81 


r*(SSD) 


1.04 


UG3 


1.43 


1. 17 


.83 


1.04 


1.04 


1.67 


1.25 


.90 
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TABLE 7 
Real Data Examples 





DATA 


1 

1 


DATA 


2 


Oi3S« 


Y 




If 


X 


1 






U7 


40 




7 1 




38 

-J sJ 


38 




7ft 


^7 


36 


42 


ii 




1 1 
1 i 




42 


c 
3 


An 






LI 


O 


f J 








/ 


^ o 


RA 


13 


26 


A 
o 






17 


A 

w 


Q 


7*? 






42 


1 n 


O u 


S3 


37 


45 


1 1 

1 1 


-J ji' 


51 


14 


5 


1 9 


7 1 


51 


38 


34 


1 1 
1 J 




U1 


31 


34 


1 U 


S7 


S9 


25 


5 








32 


38 


1 w 


79 


47 


24 


0 


17 


37 


44 


38 


22 


1 A 
1 o 




58 






19 


61 


Ii3 






20 


66 


44 






21 


53 


48 






22 


58 


55 






23 


54 


50 






24 


51 


42 






25 


80 


59 






26 


33 


13 






27 


80 


58 






28 


53 


43 






29 


67 


52 
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TABLE 8 

Correlation Estimates for Seal Data 





DATA 1 


DATA : 


rCP) 


.638 


.683 


r*(B«) 


.1168 


-703 


r*(BSD) 


-396 


.710 


r*(S) 


.5113 


.667 


r*(t) 


.585 


.705 


r*{BVT) 


.352 


.664 


r*(SSD) . 


.621 


.812 
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Y I 
65 + 



PLOT OF Y*X 



FIGURE 1 
EXAMPLE. I 
SYMBOL USED IS * 



60 + 



55 + 



50 + 



45 + 



AO + 



35 + 



30 + 



25 + 



20 + 



15 ^ 



10 + 



* 



33 37 41 45 49 53 57 61 65 69 73 77 



EKLC 



34 



33 



Y 


1 


45 


+ 


44 


+ 


43 


. + 


42 


+ 


41 


4- 


40 


+ 


39 


+ 


38 


+ 


37 




36 


+ 


35 


+ 


34 


+ 


33 




32 


+ 


31 


+ 


30 


+ 


29 


+ 


28 


+ 


27 


+ 


26 


+ 


25 


.+ 


24 


+ 


23 




22 


+ 


21 


+ 


20 


+ 


19 


+ 


18 




17 


+ 


16 


+ 


15 


+ 


14 




13 


+ 


12 


+ 


1 1 


+ 


10 




9 


+ 


8 


+ 


7 


+ 


6 


+ 


5 


+ 


4 


+ 


3 


+ 


2 


+ 


1 


+ 


0 


+ 



FIGURE 2 
EXAMPLE 2 
PLOT OF Y*X SYMBOL USED IS * 



* 



* 
* 



1 3 1 7 2 1 2 5 29 33 37 4 1 45 

X 

note: 1 OBS HIDDEN 
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